Global Range Encoding for Efficient Partition Elimination

نویسندگان

  • Jeremy Chen
  • Reza Sherkat
  • Mihnea Andrei
  • Heiko Gerwens
چکیده

Skipping mechanisms have been extensively studied to improve query performance over large data volumes. A powerful skipping technique for in-memory columnar databases is partition elimination. The goal is to eliminate, as much as possible, loading physically partitioned data into memory and probing column partitions against queries. This is achieved by consulting column partition summaries. The summary is often very compact compared to the column partition itself, and is kept in memory, e.g. the MinMax zone map. These summaries have been extensively integrated into modern in-memory database systems including SAP HANA [6]. In this paper, we argue that probing byMinMax range is not efficient when there are gaps in the values that appear in a column partition. Any predicate that needs to probe values in a gap inside a MinMax range naturally ends up requiring a candidate check; this reduces the benefits of column partition pruning. To address this problem, we propose a mechanism to encode each partition (likewise, query) using global ranges, carefully designed to reduce false positive rates. Our approach not only provides a compact in-memory representation, but also supports efficient partition pruning using bitwise operations. Compared toMinMax, our experiments support that our approach significantly reduces the false positive rate. It can allocate memory budget among ranges in partition groups, based on column density, estimated false positive rates from recent workload, and gaps.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Convenient and Efficient Elimination of Heavy Metals from Wastewater Using Smart Pouch with Biomaterial

A newly developed Smart Pouch with enclosed biomaterial (Aloe vera and coconut husk powder) has been experimented for elimination of heavy metals i.e. (Pb2+, Cu2+, Ni2+ and Zn2+) from wastewater. The effect of concentration, pH, temperature, contact duration etc. was investigated using batch experiments which resulted that the Pouch may be accepted for convenient, efficient and low-cost accumul...

متن کامل

Convenient and Efficient Elimination of Heavy Metals from Wastewater Using Smart Pouch with Biomaterial

A newly developed Smart Pouch with enclosed biomaterial (Aloe vera and coconut husk powder) has been experimented for elimination of heavy metals i.e. (Pb2+, Cu2+, Ni2+ and Zn2+) from wastewater. The effect of concentration, pH, temperature, contact duration etc. was investigated using batch experiments which resulted that the Pouch may be accepted for convenient, efficient and low-cost accumul...

متن کامل

k-Efficient partitions of graphs

A set $S = {u_1,u_2, ldots, u_t}$ of vertices of $G$ is an efficientdominating set if every vertex of $G$ is dominated exactly once by thevertices of $S$. Letting $U_i$ denote the set of vertices dominated by $u_i$%, we note that ${U_1, U_2, ldots U_t}$ is a partition of the vertex setof $G$ and that each $U_i$ contains the vertex $u_i$ and all the vertices atdistance~1 from it in $G$. In this ...

متن کامل

Adaptive Global Elimination Algorithm for Low Power Motion Estimation (J. Low Power Electronics 5: 1-16 (2009))

Motion estimation typically consumes 50% to 70% of total power in video encode application. Optimizing the power consumption of motion estimation process is of great importance to low power video applications. Power dissipation increases with computational complexity. Reduction in motion estimation complexity is usually associated with increase in bit rate and a loss of quality. We explore a se...

متن کامل

Edge-based Partition Coding for Fractal Image Compression

This paper presents an approach for fractal image compression that yields the best performance compared to fractal methods that do not rely on hybrid transform coding. The achievement is obtained using the standard algorithm in which the image is partitioned into non-overlapping range blocks which are approximated by corresponding larger domain blocks with image intensities that are affinely si...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018